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We report the complete solid-state MAS NMR resonance assignment of a medium-sized, trimeric 
membrane protein, YadA-M. The protein YadA ( Yersinia adhesin A) is an important virulence factor of 
enteropathogenic Yersinia species (such as Yersinia enterocolitica and Yersinia pseudotuberculosis). YadA 
is localized on the bacterial cell surface and is involved in adhesion to host cells and tissues. It is anchored in 
the outer membrane by a transmembrane anchor domain (YadA-M). This domain hosts the so-called 
auto transporter function of YadA: it transports its own N-terminal domain through the outer membrane. 
The assignment is based on a dataset that consisted of several MAS NMR correlation spectra, recorded on a 
single, uniformly ^^C, ^^N- labelled microcrystalline preparation. Except for the single C-terminal residue 
and the mobile strep tag, we were able to completely assign YadA-M. From this, secondary structure 
elements were predicted, which, combined with several long-range interstrand restraints, yielded the 
architecture of the p- sheet. 

YadA is a virulence factor found in the outer membrane of pathogenic Yersinia species, such as Yersinia 
enterocolitica and Yersinia pseudotuberculosis^ . This membrane protein is a prototypical member of a 
group of non-fimbrial, non-pilus adhesins, named trimeric autotransporter adhesins (TAAs)^. Its trans- 
membrane anchor domain (hereafter called YadA-M) forms a highly stable trimeric P-barreP. Upon membrane 
insertion which is dependent on the Bam machinery^, the YadA anchor domain functions as an autotransporter: 
it transports its own N-terminal domain through the membrane, a process that can be slowed down and even 
inhibited by single point mutations in the transport pore^. After completion of this transport, the three N-termini 
build a stalk with a sticky head group that can adhere to extracellular matrix components such as collagen on the 
host cell surface^. This adherence is one of the first steps in the infection pathway, resulting in several enteric food- 
borne diseases in the case of Yersinia, ranging from enterocolitis, acute enteritis, diarrhoea, mesenteric lympha- 
denitis, septicaemia, to autoimmune disorders such as reactive arthritis \ The autotransport mechanism is 
currently poorly understood; knowledge about the structure and dynamics of the domains involved in the 
autotransport will contribute to a better understanding of this process and will shed light on the pathogenic 
function of TAAs in Yersiniae as well as in other species. 

Membrane proteins are generally difficult to purify and to crystallize, which renders them difficult to be studied 
with well-established techniques for structure determination, such as X-ray crystallography and solution NMR 
spectroscopy^. Indeed, attempts to prepare high-quality single crystals of YadA-M diffracting well enough for 
structure studies with X-ray crystallography were thus far unsuccessfuP. However, during our efforts to optimize 
crystallization conditions, protocols were found to prepare microcrystalline material in large amounts. Such 
'messy' crystals are highly suitable for structure studies with solid-state magic-angle spinning (MAS) NMR, since 
this technique neither relies on availability of high-quality crystals, nor on rapid molecular tumbling. In fact, the 
high degree of structural order of the bulk material within the microcrystals aids to reduce inhomogeneous 
broadening of the NMR lines^. In this work, we report on the ^^C and ^^N chemical shift assignment of the 
membrane anchor domain of YadA by solid-state MAS NMR. From the chemical shifts, secondary structure 
elements were predicted, which, in combination with long range magnetization transfers, yielded the topology of 
the beta-sheet. A full structure calculation is presented in a separate paper^. The YadA-M construct used in these 
studies consists of the trimeric membrane anchor domain and the first part of the stalk, and contains 105 amino- 
acid residues per monomer. We were able to sequentially assign YadA-M almost completely, using a single, 
uniformly ^^C, ^^N labelled micro -crystallized sample made from detergent-extracted protein as described in^ 
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The chemical- shift assignment was an essential first step towards the 
structure calculation of YadA-M^ (PDB entry 2LME). This study 
briefly encompasses the strategies and experiments used to achieve 
the complete chemical shifl; assignment of a membrane protein using 
uniformly ^^C, ^^N labelled microcrystalline material. We believe that 
the scenario in which high-quality crystals are not available and 
protocols for the preparation of micro- crystals do exist, is not unique 
for our YadA-M system; rather, it is a situation that is commonly 
encountered in structural biology and puts the strategy described in 
this paper in a broader perspective. 

Results 

Spectral quality. The homotrimer YadA-M consist of three 
protomers of 105 residues each, of which the first fifteen residues 
are part of a highly flexible strep -tag introduced to aid in protein 
purification^. Figure la shows a one- dimensional (ID)^H-^^C cross- 
polarization (CP) MAS NMR spectrum recorded from uniformly 
^^C, ^^N-labelled YadA-M. The spectrum is well-dispersed, which is 
illustrated by the many flne details in the generally poorly resolved 
carbonyl region around 172 ppm (shown as inset in Fig. la). The 
overall high dispersion is partly the result from the secondary 
structure of TAA membrane anchors, which for YadA-M is about 
half ot-helix / half P-sheet^. In addition, Fourier-transform infrared 
studies on YadA-M have shown that the main part of the protein is 
extremely rigid, leading to a very high spectral resolution also in 
FTIR^. The (micro-) crystalline environment and the overall 
rigidity of the protein will contribute to the structural homogeneity 
and hence, to a narrow line width. In contrast, the strep-tag is known 
to be highly mobile (colour- coded in grey on the primary sequence in 
Fig. lb) ^. As a result, it can be expected that the MAS NMR signals of 
residues in the strep-tag will be very weak or not observed at all. The 
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Figure 1 | ID ^^C spectrum and YadA-M sequence. The spectrum (a) is 
recorded at a field of 900 MHz, a spinning frequency of 15 kHz and at a 
temperature of 275 K, using a standard ^H-^^C cross polarization (75- 
100% ramp on ^^C)^^. The inset shows the carbonyl region. The starting 
points of the sequential assignment are colour coded on the primary 
sequence of YadA-M; residues that appear only once (HI 6, P48, M96) or 
twice (T30 and T57; 171 and 1103) are coloured red, their sequential 
neighbours blue, and unique sequential pairs green. 



presumption that all strong signals arise from residues in the rigid 
part of the protein was proven during the course of the assignment 
procedure by the self-consistency of the assignment. 

As a quality check and to monitor the integrity of the sample, ID 
spectra were recorded directly before and after any multidimensional 
experiment. During the course of the work, no indication for change 
or degradation of the sample was observed. Sample integrity was also 
monitored by following the Cs-Cy correlation of the (single) methio- 
nine in the sequence (M96), which did not show any sign of oxidation 
(See Supplementary Fig. SI online). Hence, these findings imply that 
the YadA-M trimer is remarkably stable over a longer period of time. 

Assignment strategies - Identification of spin systems. The sequence- 
specific assignment was achieved using a dataset consisting of 
several two- and three-dimensional (2D and 3D, respectively) 
homo- and heteronuclear correlation experiments (for a detailed 
description of the experiments, see the supplementary material - 
Supplement Table 1). In a first step, 2D carbon-carbon correlation 
experiments were analysed to search for 'fingerprint patterns' of the 
various types of amino acids. In a second step, sequential relation- 
ships are established by linking ^^C resonances to the backbone- 
amide ^^N using heteronuclear, multidimensional experiments. 

Figure 2 shows the aliphatic and carbonyl regions of a 2D ^^C-^^C 
homonuclear correlation spectrum of YadA-M. The correlations 
were obtained by using a DARR/RAD scheme with 25 ms mixing^°'^\ 
The spectrum is fairly well resolved and several amino-acid types can 
be directly identified upon their characteristic correlation pattern 
and unique chemical shifts, such as threonines, serines and isoleu- 
cines; other residues give fingerprint patterns in the more congested 
regions of the spectrum and are not readily identifled due to ambi- 
guity; still, in many cases, side- chain correlation patterns can be 
traced putatively and their assignment is achieved by checking for 
self-consistency in the different spectra of the dataset. Glycines do 
not have side-chains, hence, they do not provide cross peaks in the 
aliphatic region. However, they are readily identifled upon their 
unique upfleld- shifted Ca-C correlations (cf Fig. 2b). 

Using unique doublets and triplets for assignment. The identi- 
flcation of flngerprint patterns yields residue -specific, but mostly 
not sequence-speciflc assignments. However, when these patterns 
are distinctive either on basis of well- dispersed chemical shifts or 
arise from residues that are unique or appear in a very little num- 
ber, they provide good starting points for the sequential assignment. 
YadA-M has a highly repetitive primary sequence of mainly alanines 
(13), glycines (12), serines (12), leucines (8), valines (8), asparagines 
(6), aspartates (6), lysines (6), arginines (5), glutamines (5), tyrosines 
(5) and phenylalanines (5); together these residues constitute 91 out 
of 105 in total for YadA-M. We found eight unique spin systems and 
used them as starting points for sequential assignment. This is illu- 
strated in Figure lb, which shows the primary sequence of YadA-M; 
colour-coded in red are observable amino acids that appear either 
only once (HI 6, P48, M96) or twice in the sequence (T30 and T57; 
171 and 1103). From these starting points, the neighbouring residues 
can be assigned (colour-coded blue in Fig. lb). There are three 
glutamic acids in YadA-M (E12, E79 and E104). E12 resides in the 
strep -tag and is not observed. Since El 04 is identifled by its nei- 
ghbouring 1103, the remaining glutamic acid (E79) is considered 
unique and colour- coded red as well. In addition, pair- wise resi- 
dues that form unique doublets with at least one of the members 
easily identiflable are coloured green. We found 19 of such pairs, 
which additionally provided unambiguous starting points for the 
sequential assignment. 

Sequence specific assignment. For the sequence- specific assign- 
ment of the residues, it is mandatory to know the backbone ^^N 
chemical shift for each residue. For this purpose, band-selective 
heteronuclear ^^C-^^N correlation spectra are recorded. Here, the 
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Figure 2 | 2D ^^C-^^C DARK spectrum. Contour plot of a 2D DARK experiment recorded at a field of 900 MHz, a spinning frequency of 12 kHz and 
at temperature of 275 K. Magnetization between ^^C spins was exchanged by use of a 25 ms DARR mixing period. The aliphatic region (a) and carbonyl 
region (b) are shown together with chemical shift assignments of YadA-M. 



essential building block is a selective polarization transfer step 
between the amide ^^N and the ^^Ca (which is called NCA transfer) 
or the C (NCO transfer) achieved with adiabatic cross polarization 
optimized for either the NCA condition or the NCO condition^^. The 
NCA transfer is intraresidual, i.e., it connects between ^^N^ and ^^Ca^ 
of the same residue; the NCO transfer is interresidual and involves a 
crossing of the peptide bond to connect ^^N^ with ^^C'/.i. Figure 3 
shows a 2D NCA (right) and NCO spectrum (left). In the NCA 



spectrum, certain residues give rise to fingerprint signals. For in- 
stance, glycines and alanines provide easily identifiable correlations 
due to their upfield shifted ^^Ca signals, and upfield (glycine) or 
downfield (alanine) shifted ^^N signal. Valines are characterized by 
a downfield shifted ^^Ca and ^^N resonance. Serines have an upfield 
^^N signal and an intermediate ^^Ca signal (between 55-60 ppm). 
These identification 'rules' can only be applied in one direction: 
whereas it is true that in the typical glycine, alanine or valine NCA 
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Figure 3 | 2D ^"N-^'C NCA and 2D NCO spectra. Contour plots of a 2D NCO (a) and NCA spectrum (b), recorded at a field of 400 MHz, a spinning 
frequency of 8 kHz and at a temperature of 275 K. Polarization exchange between ^H and ^^N was obtained by use of a 1 ms standard cross-polarization 
(CP) period; selective transfer from ^^N to ^^C (a) or ^^Ca (b) was achieved by a 4 ms adiabatic CP^^ Chemical shift assignments for YadA-M are indicated 
in the figure. 



region no other residues are found, the other way round it is not true, 
as demonstrated by the shifts of e.g. G52, A41, A99 and V87. In the 
NCO spectrum (Fig. 3, left), carbonyl spins are correlated with 
backbone nitrogen spins. Note that in the NCO experiment no 
fingerprint patterns are found, since the correlations connect be- 
tween spins of different amino acids. The two spectra are linked by 
the ^^N frequency of the backbone amide, providing sequential 
connectivities. Several examples to illustrate this pair- wise linking 
are highlighted in red in Figure 3. 

In general, assignment walks can only be established if the NCA 
and NCO correlations are 'dressed' with the shifts of the side-chain 
spins, which is usually done by extending the NCA or NCO experi- 
ments with a non-specific ^^C-homonuclear transfer step (e.g., by 
employing a proton-driven spin diffusion (PDSD) transfer step^^, 
or DAR/RADD mixing). These experiments are referred to as either 
NCACX or NCOCX (where CX denotes 'any' carbon) The assign- 
ment procedure is then to identify the same side chain resonance 
pattern correlated to its own backbone ^^N (NCACX) and the back- 
bone ^^N of the next residue in the sequence (NCOCX). These experi- 
ments are best recorded as 3D data to lift overlap between side-chain 
patterns by the additional ^^N frequency. In this respect, the NCOCX 
correlation is more dispersed than the NCACX correlation, since the 
amide ^^N and the ^^C signals do not cluster in fingerprint regions as 
is the case for the NCA correlations. In Figure 4, several 2D strips 
extracted from a 3D NCACX (red contours) and a 3D NCOCX (cyan 
contours) are shown. Both 3D spectra were recorded with 35 ms 
DARK mixing following the adiabatic ^^N-^^C transfer to exchange 
polarization between backbone and side chains. The strips illustrate 
the assignment procedure. In strip A from the NCACX experiment, a 
threonine spin system is identified based on its fingerprint pattern; 
the threonine can either be T30 or T57, as we have only two threo- 
nines in the sequence. The 3D NCOCX spectrum is scanned for the 
same pattern of side chain signals and at ^^N= 122.7 ppm, a similar 
cross peak pattern is found (shown in strip B). In the next step, a strip 
is extracted at ^^N = 122.7 ppm from the 3D NCACX experiment 
(strip C). In this strip, a cross peak pattern is found that is attributed 



to an arginine rather than to an alanine, from which the sequential 
connection can be established between T30 and R31 rather than 
between T57 and A58. To confirm this assignment, the next residue 
should be a valine (V32), of which the nitrogen frequency can be 
found in the NCOCX experiment at 121.0 ppm (strip D). Indeed, the 
cross peak pattern observed at a ^^N frequency of 121.0 ppm in the 
NCACX spectrum is highly characteristic for a valine (cf. strip E) and 
thus confirms the sequential assignments made. By searching for the 
V32 side chain signals in the NCOCX, the ^^N shift of D33 is found at 
121.6 ppm (cf. strip F). This procedure is continued and the stretch 
of amino acids from T30 to K34 is assigned as shown in strips A-J in 
Figure 4. 

Assignment using selective transfers. In cases of repetitive spin 
pairs or when cross peaks appear in the more congested regions of 
the spectra, multiple assignment possibilities may exist. In order to be 
able to assign residues unambiguously, additional spectra were 
recorded. For example, a helpful addition to the dataset was a 3D 
NCACB spectrum, where selective exchange between ^^Ca and ^^CP 
was achieved with a DREAM transfer scheme^^'^^. In this spectrum, 
Ca-CP correlations of Ala, Val, Lys, Leu and Asp could be 
unambiguously resolved from overlap with correlations from other 
residues. Along the same lines, two 2D ^^C correlation spectra were 
recorded, employing a DREAM transfer scheme optimized for the 
Cot-CP and CP-Cy spectral regions (Supplementary Fig. S2 online). 
The advantage of the DREAM experiment is that it can be set up so 
that it predominantly provides transfers between spins that are 
directly connected; in our experience, transfer events over two 
bonds occur less frequently and are mainly observed for spin pairs 
where at least one of the two carbon atoms is only weakly coupled to 
^H. Examples are two -bond couplings involving carbons in methyl 
groups (like leucine C5i-C52, leucine CP-C5i/2, valine Cyi-Cy2 and 
isoleucine CP-C5) or carbons in flexible side chains (like lysine Cs- 
Cy and arginine C5-Cp). Such occasional two-bond transfers can be 
easily distinguished from one-bond transfers since both have 
opposite sign in cross -peak intensity. Note that in highly congested 
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Figure 4 | Strips extracted from 3D NCACX and 3D NCOCX experiments, illustrating the sequential correlation strategy. The 3D experiments were 
recorded at a field of 600 MHz, a MAS frequency of 10 kHz and at a temperature of 275 K. Initial ^^N polarization was created with a standard ^H-^^N CP 
(2 ms, 75-100% ramp on ^H), followed by selective transfer from ^^N to ^^C or ^^Ca using an adiabatic CP. A PDSD (proton- driven spin diffusion) 
sequence of 35 ms was applied to exchange magnetization between the ^^C spins. 



regions of the spectrum a partial cancellation of cross peaks may 
occur. In the 2D Coc-CP DREAM spectrum, correlations between 
spins that are more than one bond away are virtually suppressed 
(Supplementary Fig. S2 online). This is instrumental to distinguish 
Cot-CP cross peaks of valines, lysines, arginines and glutamic acids 
from two-bond Ca-Cy correlations that appear in the same spectral 
region (e.g. from leucines, lysines and arginines). By optimizing the 
DREAM transfer around a more upfield frequency, one-bond trans- 
fers, mainly between ^^Cp and ^^Cy carbons, can be distinguished 
from two-bond transfers involving methyl groups (Supplementary 
Fig. S3 online). This is particularly useful for distinguishing Cy-C5i/2 
and C5i-C52 resonances of leucines. 

Consistenq^ check of the sequence specific assignment. In 

addition, we recorded 3D NCACX and 3D NCOCX with longer 
mixing times (100 and 200 ms PDSD transfer). Controversially, we 
found that such spectra helped to remove ambiguity, even though the 
amount of cross peaks is higher. The explanation for this is, whereas 
in NCACX with short CC mixing times magnetization is only 
exchanged intra-residually ('forward transfer'), with longer mixing 
times the ^^Ca also exchanges with the backbone and side-chain 
spins of the previous residue in the sequence ('backward transfer'); 
this provides a great means for checking the self-consistency of the 
sequential assignment without need to revert to the intermedia- 
te ^^N/ and ^^C'/.i shifts. In complete analogy, NCOCX spectra with 
longer CC mixing times do not only provide backward transfer 
between ^^C and the side chain of the same residue, but also 
forward transfer, directly linking ^^C'/.i with the side chain of 
residue /. An example of multiple transfer pathways mutually 
linking the pair S44-L45 is shown in Figure 5. In the top part of 
the figure, magnetization transfer events are shown schematically. 
Red arrows represent the transfer of ^^C polarization within a spin 
system, blue arrows the exchange between residues, using the ^^Cot 
(left) or ^^C (right) as starting point. The sequential assignment of 
the residue pair S44 and L45 is shown, aided by inter-residue 



correlations. In the NCACX experiment recorded with long PDSD 
mixing time of 200 ms, polarization can be transferred from ^^Ca of 
residue / to residue /-I. This is shown schematically in the top -left 
part of Figure 5 where blue arrows indicate the backward transfer 
from Cot of L45 to C, Ca and CP of S44 (labelled with 3 and 4). The 
corresponding cross peaks are labelled accordingly in strip B. In 
contrast, in the NCACX experiment recorded with a short mixing 
time of 35 ms, only intra-residue cross-peaks for L45 are observed 
(strip A). Likewise, in the NCOCX experiment with long PDSD 
mixing, polarization can be exchanged between ^^C'/.i and residue 
/ (forward transfer. Fig. 5 top right, blue arrows). This is demon- 
strated in strip D which shows both intra-residue (labelled with 5) 
and inter-residue correlations (labelled 6 and 7). Interestingly, in the 
NCOCX obtained with a short mixing of 35 ms, apart from strong 
intra-residue cross peaks, weak transfer from S44 C to L45 Ca can be 
observed (cf. strip C). 

Finally and along the same lines, a further powerful check for 
the consistency of the sequence- specific assignment is achieved by 
analysis of direct correlations between the ^^Ca spins. Also here, 
this transfer does not involve the intermediate backbone amide and 
carbonyl ^^C spins; hence, any potential mistake made during the 
alignment procedure of strips as described above would lead to incon- 
sistent results. Sequential Ca-Ca correlations are readily observed in 
^^C-^^C spin diffusion spectra with extended DARR or PDSD mixing 
times (typically 100-200 ms). As an example, the sequential walk 
connecting the A82-A90 subsequence via the Ca's is shown in Sup- 
plementary Figure S3 online. Since the relevant information is 
obtained from a relatively narrow spectral region, these experiments 
perfectly lend themselves for /-decoupUng in the indirect dimension^^. 
Plots of the Ca-Ca region in a standard and /-decoupled DARR 
spectrum are included in Supplementary Figure S3 online. 

REDOR and methyl filtered schemes. Prolines are difficult to assign 
on the basis of 2D or 3D NCACX experiments because the lacking 
amide proton renders the first ^H- ^^N CP very inefficient. In the 2D 
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Figure 5 | Schematic representation of magnetization transfer (top); red arrows represent intra-residue transfer of ^^C polarization, blue arrows 
inter-residue ^^C exchange, starting from ^^Ca (left) or ^^C (right). 2D strips extracted from 3D NCACX (strips A and B) and 3D NCOCX spectra 
(strips C and D, bottom); the 3D spectra were recorded with proton-driven spin diffusion mixing of 35 ms (strips A and C) and 200 ms (strips B and D). 
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peaks indicated in (a) that define the topology of the p-strands are highlighted in red ("H"). The pi strand from the adjacent monomer is shown (in blue) 
to be able to indicate correlations between strands pi and p4. (c) Ribbon model of the YadA-M monomer*^. 



NCA experiment shown above, only a relatively weak cross peak 
with ^^Cot is observable for P48 (cf. Fig. 3b); however, in the 2D 
NCO experiment (Fig. 3a), and in the 2D or 3D NCACX and 
NCOCX experiments (data not shown), no side-chain correlations 
involving P48 were observed. Prolines provide, however, a highly 
characteristic correlation pattern in 2D ^^N-^^C TEDOR experi- 
ments^^'^^, consisting of pair-wise cross peaks involving the Cot and 
C5 in the a-region with the downfield-shifted backbone amide. As 
shown in Figure 6, a proline fingerprint pattern appears at a ^^N shift 
of 132.2 ppm, which is assigned to P48. Note that the other proline in 
the sequence (P9) resides in the mobile strep -tag and is indeed not 
observed in Figure 6. 

The last experiment in the dataset was a methyl-filtered post-C7 
experiment. Methyl filtering can be achieved spectroscopically by a 
straightforward combination of cross-polarization with phase inver- 
sion (CPPI) and a dipolar dephasing step^°. The filter selects carbon 
atoms that are relatively weakly coupled to ^H, primarily targeting at 
the methyl- carbons, but also non-protonated carbonyls and carbons 
in flexible side chains. In combination with the Post-C7^\ the spec- 
trum is highly instrumental for the assignment and/or corroboration 
of the assignment of methyl- containing amino acids (Ala, He, Leu, 
Val, and Thr). More specifically, it helped to unambiguously identify 
T57, which has an unusual upfield- shifted ^^Ca; as a result, the Cot- 
CP cross peak of T57 is found in the serine Ca-CP fingerprint region 
and as such was not identified straightforwardly (cf Fig. 2 and Sup- 
plementary Fig. S4 online). Additionally, in the carbonyl part of Post- 
C7, CP-Cy correlations for Asn, Asp and Cy-C5 correlations for Gin, 
Glu can be readily identified. 

Topology of beta sheet. The backbone ^^C and ^^N chemical shifts 
obtained from the sequential assignment can be assessed to predict 
secondary structure elements. For this, we used the program 



TALOS+'l According to the TALOS+ prediction, YadA-M 
consists of an N-terminal a-helix (residues H16-L45) and four P- 
strands (Pl: K53-Y63; P2: S66-R76; P3: V81-G91; P4: V95-E104) 
connected by relatively tight beta-turns. The residues F46-G52 that 
connect the helix with pi are predicted as an unstructured loop re- 
gion. In addition to the TALOS+ predictions, we recorded CHHC 
experiments to obtain interstrand correlations. The interstrand 
distance between protons on adjacent alpha carbons in an anti- 
parallel P-sheet is approximately 2.3A. We recorded 2D CHHC 
spectra using various mixing times and found several correlations 
that could be unambiguously assigned as interstrand cross peaks 
(Fig. 7a). Taking into account that the loops connecting the P- 
strands are fairly short, correlations between pi-P2, P2-P3 and P3- 
P4 are considered intramolecular. The P-sheet topology consisting 
of a 4 -stranded, antiparallel p-sheet which is in agreement with 
TALOS+ predictions and the observed inter-strand cross peaks is 
shown schematically in Figure 7b and as ribbon model in Figure 7c. 
We also observed cross peaks between pi and P4. In principle these 
correlations could be intramolecular, but this would result in a highly 
unlikely narrow beta-barrel with perpendicular P-strands. Rather, we 
attribute correlations between strands 1 and 4 to intermolecular 
interactions between the protomers forming the trimer. The rela- 
tive position of strand 4 of one monomer with respect to strand 1 
of a second monomer defines the shear number of the closed P- 
barrel. For more details and the full structure calculations, see 
reference^. 

Discussion 

We report the solid-state MAS NMR resonance assignment of a 
medium-sized trimeric membrane protein, YadA-M. From evalu- 
ation of the dataset as described above, we could unambiguously 
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assign all residues in the subsequence 14-104 of YadA-M (BMRB 
entry 18108). The residues that we were unable to assign are those in 
the strep-tag. It has been shown by subtilisin treatment that the strep- 
tag is extremely mobile and renders fair degree of flexibility to the N- 
terminal residues^. As a consequence, most of the residues in the 
strep-tag were not observed in any of the spectra of our dataset, while 
some others gave very weak and incomplete cross peak patterns. For 
instance, as noted above, only one strong proline signal set was 
detected, whereas the proline that resides in the strep -tag was not 
observed (cf. Fig. 6). In addition, in ^H-^^C INEPT spectra, signals are 
observed that cannot be attributed to the detergent but can be tenta- 
tively assigned to the strep-tag residues (Supplementary Fig. S5 
online). The assigned residues in the subsequence 14-104 can com- 
pletely account for the cross peaks detected in the various spectra of 
the dataset; the sequential assignment is highly self-consistent (e.g., 
cf Fig. 5) and no strong cross peaks were 'left over' as unassigned, 
which strongly supports our initial presumption that the observable 
signals predominantly arise from the rigid part of the protein. Also 
for the rigid part, we found that for several regions cross peaks 
appeared systematically weaker, suggesting a higher degree of (local) 
mobility. Among these are the subsequence K53-V54-N55, the 
region around A3 7 and the C -terminus (W105). Although a tentative 
assignment for W105 was possible, the sequential link was poor and 
we do not consider W 105 as part of the assigned residues. 

Despite the highly repetitive primary sequence of YadA-M, several 
unique spin systems and a fair number of pair-wise unique spin- 
systems were found that served as starting points of the assignment 
(cf. Fig. lb). We found that glycines and alanines, even though they 
are relatively abundant in YadA-M, proved very useful for sequential 
assignment. Their backbone ^^Ca and ^^N are readily identifiable 
from 2D NCA type spectra (cf Fig. 3) and can be used as sequential 
linkers in a parallel evaluation of 3D NCACX and NCOCX spectra. 
Serine residues, on the other hand, turned out to be less helpful; 
although serines are generally easily identifiable by virtue of their 
downfield CP chemical shift, six out of the 12 serines in YadA-M 
form SS pairs, which made it difficult to link them sequentially. 

At all stages of the assignment procedure, the choice of mixing 
time played a crucial role for deriving the required information. In 
the initial steps of the chemical shift assignment, identification of 
spin systems required relatively short mixing times, typically 10- 
20 ms for PDSD, to ensure predominantly intra-residue spectral 
cross peaks. For establishing sequential links, a PDSD or DARR 
mixing scheme with 100 ms proved to be sufficient. Still longer mix- 
ing times (200 ms) were useful in a final stage of the assignment 
process to cross check the assignment for consistency. Selective 
transfers using DREAM mixing in 3D NCACB, 2D CAGE and 2D 
CBCG helped identification of several spin systems by reducing the 
information content of spectra. Amino-acid residues with methyl 
groups i.e., Ala, Leu, Iso, Val and Thr were assigned/confirmed with 
the help of a double -quantum methyl filtered Post-C7 experiment. In 
the carbonyl part of the Post-C7 experiment, presence of CP-Cy 
correlations for Asn, Asp and Cy-C5 correlations for Gin, Glu made 
this spectrum extremely useful. Aromatic amino-acid residues gen- 
erally show relatively weak intra- and inter-residue cross-peak 
intensities owing to their aromatic rings which can act as magnet- 
ization sink^^. However, they provide highly characteristic cross 
peaks which allow their unambiguous assignment. Phenylalanines 
were identified on the basis of their aromatic Cy cross peaks around 
140-142 ppm. Tyrosines and arginines show their cross peaks 
around 159 ppm. In turn, tyrosines can be distinguished from argi- 
nines by virtue of their aromatic Csl and Cs2 cross peaks (116- 
117 ppm). 

YadA is a trimeric autotransporter protein. Its transmembrane 
domain is homologous to other transmembrane p -barrel proteins 
from Gram-negative bacteria^^. This large and diverse family of pro- 
teins has some conserved features also seen in YadA: it consists of an 



even number of amphipathic, antiparallel p-strands, and both ends 
of each strand are frequently made from aromatic residues that are 
known to interact with the water-lipid interface to anchor the pro- 
teins in the lipid membrane^^. In contrast to most other P -barrel 
proteins where the barrel structure is produced from a single poly- 
peptide chain, YadA is a homotrimer where all three subunits con- 
tribute four P-strands to the barrel. 

In summary, we report the complete solid-state MAS NMR res- 
onance assignment of a medium- sized trimeric membrane protein, 
YadA-M, based on a single, uniformly ^^C, ^^N-labelled, microcrys- 
talline preparation. The assignment was achieved on basis of a data- 
set that consisted of several homo- and heteronuclear MAS NMR 
correlation spectra. We were able to unambiguously assign the resi- 
dues 14-104 of YadA-M, which covers the entire membrane anchor 
domain. The residues that we were unable to assign are those in the 
strep-tag and the C-terminal residue. Since the strep-tag is not part of 
the native protein and has no physiological function, we can con- 
clude that, except for W 105, we were able to assign the entire physio- 
logically relevant part of the anchor domain. The full chemical- shift 
assignment forms an important requirement and first step towards 
the structure calculation of YadA-M^. The micro -crystalline material 
used for our studies essentially resulted as "byproduct" from our 
(non- successful) crystallization efforts. This is a scenario that is 
not uncommon in structural biology, since membrane proteins are 
generally difficult to crystallize and it can well happen that, despite 
many efforts, at the end of the day at best poor-reflecting crystals or 
micro- crystals are obtained; such preparations are not suitable for X- 
ray diffraction studies, but still form perfect samples for solid-state 
NMR studies. Hence, failure of sample preparation for one technique 
should not be the end of the road but rather can provide invaluable 
samples for other techniques, and we believe that the current work 
could trigger many future solid-state MAS NMR investigations and 
offer new perspectives in structural biology, especially for membrane 
proteins where poor crystal quality is the rule rather than the excep- 
tion. 

Methods 

Sample preparation. YadA-M was expressed in a full medium with ^^C/^^N 
(BioExpress, Cambridge Isotope Laboratories), using the expression, detergent 
extraction and purification procedure described in^. The large-scale protein 
crystallization was done by dialysis to remove excess detergent, as described in more 
detail in^. 

NMR spectroscopy. All solid-state MAS NMR experiments were performed at 275 K 
using samples from a single ^^C, ^^N uniformly labelled YadA-M batch preparation. 
Data were obtained on 400, 600 and 900 MHz AVANCE spectrometers (Bruker, 
Karlsruhe, Germany). Spectrometers were equipped with double (^H/^^C) and triple- 
resonance (^H/^^C/^^N) CP/MAS probes (Bruker). The temperature was set to 275 K. 
Except otherwise mentioned, magnetization transfer from to the ^^C or ^^N spins 
was achieved with ramped cross-polarization (CP)^^'^^; the ramp was kept relatively 
shallow (typically between 75 and 100% of the spin-lock field strength). Typical 
recycle delays between individual transients were kept between 2.7-3.0 s, to avoid 
sample heating. High-power proton decoupling with radio -frequency (rf) field 
strengths of 75-90 kHz using the TPPM^*^ or SPINAL-64 scheme^'' was applied during 
evolution and detection periods. Typical ^H and ^^C nil pulse lengths were 3.0 and 3- 
4 |is, respectively, typical ^^N nil pulses were longer, 6-7 |is. 

Homonuclear 2D ^^C-^^C correlation spectra with dipolar-assisted rotational res- 
onance (RAD/DARR)^°'^^ proton-driven spin diffusion (PDSD)^*-^° mixing schemes 
and DREAM mixing^^-^*^ were recorded at 900 MHz. For the RAD experiments, the 
following conditions were used: experiments were obtained at 10 kHz MAS; initial 
^^C magnetization was created using a 3.0 ms ramped CP (75-100%) with average rf- 
field strengths of 63 and 53 KHz, for ^H and ^^C, respectively. Mixing times of 15, 25, 
50 and 100 ms were used, in order to probe residue specific and sequential transfers in 
different distance ranges. A 3.0 s recycle delay was used. Acquisition times were 
typically 18.0 and 9.6 ms in ^2 and ti, respectively. The PDSD experiments were 
recorded under similar conditions, but with a CP contact of 1.0 ms (75-100% ramp 
on carbons), and acquisition times in the direct and indirect dimensions of 15.0 and 
4.0 ms, respectively. PDSD mixing periods ranged between 15 ms and 100 ms. The 
2D ^^C-^^C DREAM spectra were obtained at 13 kHz MAS; a CP contact of 1 ms (75- 
100% ramp on ^^C) was used. During the DREAM mixing period of 1.5 ms, the carrier 
was placed either between the Ca and C(3 regions (at —50 ppm, Ca-CP spectrum) or 
between the CP and Cy regions (at —30 ppm, cp-Cy spectrum). Acquisition times in 
the direct and indirect dimensions were 15.0 and 8.2 ms. The experiments were 
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recorded with 1024 increments, 48 scans per increment and a recycle delay of 3.0 s, 
yielding a total experimental time of 41 hours per spectrum. 

2D NCA and NCO experiments were recorded at 400 MHz, using a MAS fre- 
quency of 8 kHz. Initial magnetization was created using a 1 ms CP transfer from ^H 
to ^^N (75-100% on ^^N). Selective transfer from ^^N to ^^C was achieved using an 
adiabatic CP transfer of 4 ms. The effective acquisition times were 20 ms and 12.8 ms 
for ^^C and ^^N, respectively. In total 128 increments were recorded in the ^^N 
dimension, with 64 scans per increment. Using a recycle delay of 3.0 s, the experi- 
mental time per spectrum was about 7 hours. 

3D NCACX and 3D NCOCX'^ experiments were recorded at 600 MHz, at a MAS 
frequency of 10 kHz. A CP contact of 2 ms was employed, with 60 and 43 kHz rf- field 
on ^H and ^^N, respectively, with a 75-100% ramp on protons. Polarization was 
transferred from ^^N to ^^Ca or ^^C' with an adiabatic CP contact of 4 ms, with rf- 
fields of 35 and 25 kHz on ^'C and ^'N, respectively. For the NCACX spectra, PDSD 
mixing times of 35, 100, 200 and 500 ms were used; for the NCOCX spectra, PDSD 
mixing times of 35 and 200 ms were applied. The 3D spectra were obtained as data 
matrices of 1372 x 48 x 32, with effective acquisition times of 12 ms, 5.2 ms and 
6.4 ms for ^^C, ^^Ca/^^C' and ^^N, respectively. For each increment, 104 scans were 
averaged with a 2.8 s recycle delay. Hence, the total experimental time for each 3D 
spectrum amounted 124 hours. The 3D NCACB experiment was recorded under 
similar conditions, however with a DREAM mixing scheme of 3 ms to selectively 
exchange between Ca and CP^^'^*^. The NCACB experiment was recorded as data 
matrix of 2484 x 64 x 48, with effective acquisition times of 20.0, 5.1 and 5.4 ms for 
^^C(3, ^^Cd and ^^N, respectively. A recycle delay of 3 s was used, and 56 scans were 
recorded per increment, leading to a total acquisition time of 144 hours. 

The ^^N-^^C TEDOR experiment (transferred echo double resonance) was 
recorded at a field of 9.4 T, at 8 kHz MAS and at a temperature of 275 K. The two 
REDOR (rotational echo double resonance) mixing periods were 1.0 ms each. 
Carbon and nitrogen 180° pulses were 6.2 \is and 13.4 |is, respectively. Initial ^^C 
magnetization was created with a 2.5 ms ramped ^H-^^C cross polarization (75-100% 
ramp on the ^^C channel). During mixing and acquisition, ^H-heteronuclar 
decoupling was applied with a TPPM scheme^*^, at a moderately high ^H rf field 
strength of 75 kHz. The spectrum was recorded with 224 scans, 128 increments and 
effective acquisition times of 22 ms and 16 ms, in the ^^C and ^^N dimensions, 
respectively. With a recycle delay of 2.8 s, the experimental time for the 2D experi- 
ment amounted 23 hours. 

^^C-^^C Methyl-filtered Post-C7 experiments were obtained at 900 MHz, at a 
spinning frequency of 8 kHz. Magnetization is prepared by using a methyl filter^"; for 
this, a long ^H-^^C cross polarization (CP) contact of 2 ms is directly followed by a 
short polarization inversion (PI) of 70 |j,s, achieved with a 180° phase change of the ^H 
spin-lock pulse (CPPl)^^. A polarization inversion period of 60 |is was found to be 
sufficient for 'depolarizing' non-methyl, protonated nuclei^^'^'^. Following this pre- 
paration, a post-C7 scheme was used to obtain a 2D ^^C-^^C correlation spectrum 
(2Q/1Q)^\ Acquisition time in the direct dimension was 12.0 ms, in the indirect (2Q) 
dimension 280 experiments with an increment of 32 |j,s were recorded. 

2D CHHC^^ spectra were recorded at 700 MHz. The initial polarization was 
transferred from ^H to ^^C using 1.5 ms CP with average rf- nutation frequencies of 60 
and 50 kHz, on protons and carbons, respectively. SPlNAL-64 decoupling with an rf- 
field strength of 85 kHz was applied during evolution and detection periods. 
Acquistion times in the direct and indirect dimensions were 12 and 7 ms with a 3.0 s 
recycle delay between individual transients. The second and third rectangular CP 
contacts were kept short at 60 \is. The ^H-^H mixing times used were 35, 50, 80, 150, 
200, 300 and 500 |is. 
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